Multimodal Object Detection via Probabilistic Ensembling

نویسندگان

چکیده

Object detection with multimodal inputs can improve many safety-critical systems such as autonomous vehicles (AVs). Motivated by AVs that operate in both day and night, we study object RGB thermal cameras, since the latter provides much stronger signatures under poor illumination. We explore strategies for fusing information from different modalities. Our key contribution is a probabilistic ensembling technique, ProbEn, simple non-learned method fuses together detections multi-modalities. derive ProbEn Bayes’ rule first principles assume conditional independence across Through marginalization, elegantly handles missing modalities when detectors do not fire on same object. Importantly, also notably improves even assumption does hold, e.g., outputs other fusion methods (both off-the-shelf trained in-house). validate two benchmarks containing aligned (KAIST) unaligned (FLIR) images, showing outperforms prior work more than 13% relative performance!

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Producing Scores for Customers via Ensembling SVM

Supervised by Dr. Yilong Yin Email:[email protected] School of Computer Science and Technology, Shandong University Jinan 250100, China Abstract This report shows our solution to PAKDD Competition 2007. Following a brief description of the data mining task, we discuss four difficulties to be dealt with in this task. Then, we show how to do the data pre-processing. To weaken class-imbalance of th...

متن کامل

Multimodal Sparse Features for Object Detection

In this paper the sparse coding principle is employed for the representation of multimodal image data, i.e. image intensity and range. We estimate an image basis for frontal face images taken with a Time-ofFlight (TOF) camera to obtain a sparse representation of facial features, such as the nose. These features are then evaluated in an object detection scenario where we estimate the position of...

متن کامل

Scaling for Multimodal 3D Object Detection

We investigate two methods for scalable 3D object detection. We base our approach on a recently proposed template matching algorithm [5] for detecting 3D objects. First, we demonstrate that it is possible to gain significant increase in runtime performance of the algorithm at almost no cost in accuracy by quickly rejecting most regions of the image with low-resolution templates. Second, we inve...

متن کامل

Probabilistic visual learning for object detection

We present an unsupervised technique for visual learning which is based on density estimation in high-dimensional spaces using an eigenspace decomposition. Two types of density estimates are derived for modeling the training data: a multivariate Gaussian (for a unimodal distribution) and a multivariate Mixture-of-Gaussians model (for multimodal distributions). These probability densities are th...

متن کامل

Object Detection Using Probabilistic Graph Model

We propose a Probabilistic Graph based Model for interactive image segmentation. A multilayer probabilistic Graph is constructed from an oversegmentation of the image to model the relationships among super pixel regions, edge segments and vertices. We used an iterative procedure to merge several regions based on the probability of the regions. Regions are merged until the user is satisfied with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20077-9_9